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Description 

BACKGROUND OF INVENTION 

Reference to Related Applications 

Filed concurrently with this application are ap- 
plications entitled "Complex Pattern Recognition 
Method and System" (published as EP-A- 0 114 
248), and "Method for Distinguishing Between 
Complex Character Sets" (published as EP-A- 0 
135 615). 

Field of Invention 

This invention relates to a method and an ap- 
paratus for identifying patterns according to to the 
preambles of claim 1 and 13. This invention relates 
for example, to recognition of handwritten char- 
acters such as Chinese characters (i.e., Kanji). Spe- 
cifically, this application relates to identification of 
complex characters composed of elements, namely 
strokes, wherein strokes of distinguishable signifi- 
cance are subject to confusion. 

The recognition of complex characters has 
been pursued with limited success for many years. 
Kanji has been considered the greatest challenge 
because it is not easily adapted to keyboard input. 
There are for example approximately 10,000 distin- 
guishable characters in use in the Kanji system 
representing various words, phrases, concepts and 
in some instances syllables. 

Various recognition schemes have been re- 
ported for hand registered characters. The 
schemes are typically based on spatial and certain 
limited shape characteristics of elements such as 
strokes, a stroke being the locus and sequence of 
a continuous chain of related points created by 
substantially uninterrupted contact between a 
pattern-forming means and a pattern-accommodat- 
ing means, such as a pen and a tablet or any other 
movement-registering instrument or system. Prior 
art schemes are aimed at collecting and retaining a 
substantial amount of information which is pro- 
cessed in an attempt to distinguish the character 
from all other characters. 

Recognition of complex characters of the type 
of interest is made more difficult because there are 
no uniform definitions for the fundamental stroke 
types from which the characters are formed, and 
there is substantial variation in character formation, 
even by the same writer. Consequently there is a 
potential for confusion between differing strokes 
and between different characters. What is needed 
is a pattern recognition scheme which is capable of 
tolerating wide variations while accurately identify- 
ing patterns and specifically characters from 
groups of basic elements such as strokes. 



Description of the Prior Art 

A pattern recognition method and apparatus in 
accordance with the preambles of claims 1 and 12 

5 are known from Odaka et al., U.S. Patent No. 4, 
317, 109. In this reference^ eacTT stroke of - the 
handwritten input character is sampled on the basis 
of a three-point approximation. In detail, the x-y- 
coordinates of the initial point, the center point and 

10 the end point of each stroke are calculated, and a 
matrix of stroke differences between each stroke of 
the input character and each stroke of a selected 
reference character is calculated. In this matrix, the 
minimum value of each line is detected and 

15 flagged. The flagged values are summed up. This 
procedure is repeated for all the reference char- 
acters, resulting in a respective number of sums. 
The minimum sum is searched, and the reference 
character related there to is recognized to be the 

20 same as the input character. This pattern recogni- 
tion system is able to recognize a handwritten 
character even if said character is written with an 
arbitrary stroke order. 

Pattern recognition, Vol. 8, no. 2, April 1976, 

25 pages 87-98 is a review article on Chinese char- 
acter recognition. In this reference, particularly a 
set of 19 types of strokes is shown. Some types of 
strokes includes multiple possibilities arising from 
handwritting variations which are to be assimilated 

30 to a single stroke. Such variants do not represent 
true distinct reference strokes and do not form 
confusion groups. 

Asia Electronics Unions, Vol. 6, no. 2, October 
1 973, pages 10-14 discloses a classification of 

35 basic kanji characters into categories with one to 
40 characters per category. 

US-Patent No. 4, 040,009 discloses a recogni- 
tion system for opto-electrical recognizing of sim- 
ple alpha-numeric characters written on a paper. 

40 This system compares the unknown input pattern 
representing a complete character with a plurality 
of prepared reference or standard patterns. In order 
to recognize confusable characters not confusable 
strokes, i. e. character elements, when more than 

45 two reference patterns having a similarity to the 
unknown input pattern larger than a predetermined 
threshold level are detected, one or, more suitable 
feature comparators are selected from a plurality of 
feature comparators by a control signal corre- 

50 sponding to the reference patterns of the category 
thereof detected by a first discriminating circuit. 
Each of the features of the probable character 
patterns is compared with the portions of the un- 
known input pattern by the selected comparators. 

55 One character having a feature most similar to that 
of the input pattern is determined as the identified 
input pattern. In a certain sense the present inven- 
tion proceeds in the opposite direction, i. e. starting 
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with recognition of character elements or strokes. 

Further prior stroke recognition systems rel- 
evant to the present invention are represented by 
the following references: 

"On-Line Recognition of Handwritten Charac- 
ters", Hiroki Arakawa et al., Review of the Electrical 
Communication Laboratories, Vol. 26,~Nbs. 11-12, 
Nov.-Dec. 1978 describes a system in which a pair 
of linear waveforms is derived by recording in 
rectangular coordinates the movement of a hand- 
written point, approximating linear waveforms 
through a rectangular function expansion and then 
recognizing a character by utilizing a set of coeffi- 
cients of the rectangular function. 

IEEE Transactions on Electronic Computers, 
Dec. 1967, pp. 856860; "Japanese Patent Applica- 
tion No. 1977- 083733 entitled "On Line Recogni- 
tion Method of Handwritten Characters" filed July 
12, 1977; and U.S. Patent No. 4,173,753 to Chou 
entitled "Input System for Sino-Computer" repre- 
sent another general type of stroke recognition 
technique, namely, pattern matching. In Chou, 
strokes are recognized as elementary patterns in 
strings of elementary strokes. In the '733 reference, 
a spatial matching technique is described. Strokes 
of a character to be recognized are approximated 
by coordinate position, and deviations from stan- 
dard coordinate patterns are computed point by 
point and summed over the whole character to 
obtain decision criteria. Analysis of these types of 
schemes supports a conclusion that increasing the 
amount of information about a stroke does not 
necessarily lead to improved recognition accuracy. 
In fact, increasing the precision of stroke registra- 
tion increases the difficulty of pattern matching. On 
the other hand, decreasing the precision of stroke 
registration causes confusion among strokes of 
similar shape but differing significance. In either 
extreme, stroke recognition accuracy degrades. 

"On-Line Recognition of Hand-Written Charac- 
ters Utilizing Positional and Stroke Vector Se- 
quences", Pattern Recognition , Vol. 13, No. 3, p. 
191 (Permagon Press, 1981) is a reference which 
reports of an extended six company/ university 
joint effort to develop a stroke vector sequence 
character recognition system based on elemental 
stroke shapes derived from five percent to ten 
percent of a stroke length. A great deal of data is 
developed about a relatively small portion of a 
stroke. The proposed system is believed to be 
expensive and insufficiently accurate to be a prac- 
tical and commercial success. 

E. F. Yhap et al., "An On Line Chinese Char- 
acter Recognition System", IBM Journal of Re- 
search and Development , Voi7"25, No. 3, ~p. 187 
(May 1981) describes a handwritten Chinese char- 
acter recognition scheme in which a large number 
of parameters about a Chinese character are cate- 



gorized, generally in relation to positions within a 
field of registration. 

Crane et al., "A Technique for the Input of 
Handprinted Chinese Characters Based on Se- 

5 quential Stroke Recognition", Proceedings of Inter- 
national Computer Symposium 1977, Vol" One, p. 
246 (27-29 Dec. 1977, Taipei, Republic of China) is 
a survey article. It further describes a proposed 
character recognition technique suitable for essen- 

w tially real time processing. It is based on stroke 
. label sequence recognition where there is limited- 
precision stroke recognition. The paper is an early 
publication related to the present work and de- 
scribes preliminary conclusions of the present in- 

75 ventors. The present invention may be used in 
connection with techniques described in this and 
other prior works. 

Crane et al., U.S. Patent No. 4,040,010 issued 
August 2, 1977, describes a handwriting verification 

20 system in which a special pen produces signals 
representative of various parameters based on an- 
gularly resolved writing pressure for identifying a 
signature or other appropriate group of relatively 
simple characters or symbols. A signature is 

25 deemed to be a forgery according to the invention 
if the sum of the component variations of the 
detected signature deviates beyond some 
preselected threshold value established by a signa- 
ture template. 

30 Various other methods not to be confused with 

the present invention relate to the pictorial aspects, 
that is, spatial appearance, of a character. These 
and other schemes have been explored for many 
years, and substantial research efforts have been 

35 largely unsuccessful in providing a practical char- 
acter recognition system which can be used in an 
interactive, essentially real-time environment. 

SUMMARY OF THE INVENTION 

40 ' " ~ 

According to the present invention as defined 
by claims 1 and 13, a character recognition 
scheme employs identification of a handwritten 
stroke by grouping elements or stroke types into 

45 confusion groups based on criteria such as general 
complexity, radial direction, other stroke-shape re- 
lated criteria or related meaning. A stroke is as- 
signed an identity label upon comparison with such 
criteria stored in the system. The criteria may take 

50 the form of a preselected set of predefined tem- 
plates, the templates each comprising for example 
a set of related data points or positions along a 
template stroke. According to one embodiment of 
the invention as claimed, distance metric values of 

55 a stroke under examination normally generated by 
comparison of the stroke under examination with 
templates or model strokes are excluded from the 
normal recognition process when the template 
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strokes which would generate the second and 
greater distance metric values belong to the same 
confusion group as the template stroke producing 
the least distance metric value. A stroke cate- 
gorized as a member of a confusion group is 
employed in connection with other strokes iden- 
tified by stroke labels to specify the intended char- 
acter. 

In a related application published as (EP-A- 0 
114 248) entitled "Complex Pattern Recognition 
Method and System" a process is disclosed 
whereby strokes are matched with templates and 
assigned identity labels. As used herein a stroke 
refers to an individual element, namely the locus 
and sequence of a continuous chain of related 
points. 

According to an alternative embodiment of the 
invention as claimed, a stroke found to match a 
template of a particular confusion group is iden- 
tified for character recognition purposes by a con- 
fusion group label. Character identification is made 
by comparing identified stroke labels and/or confu- 
sion group labels against a dictionary made up of 
stroke labels and/or confusion group labels gen- 
erally arranged in sequence of occurrence in writ- 
ing. 

According to a still further embodiment of the 
invention, a confusion group template may be a 
locus of all points and all criteria for at least two 
elements of distinguishable significance. A confu- 
sion group label is assigned if all of the criteria 
developed for the element under examination are 
included within the confusion group template. 

According to a still further embodiment of the 
invention, a confusion group may be defined as the 
locus of all criteria for one element or the locus of 
all criteria for another element. A confusion group 
label is assigned to an element under examination 
which satisfies one such set of criteria. 

The stroke confusion grouping according to the 
invention permits the deliberate discarding of in- 
formation early in the recognition process allowing 
flexible yet potentially accurate identification of 
stroke types. The grouping of stroke types of dis- 
tinguishable significance which are likely to be con- 
fused avoids attempts at unmeaningful precision 
when recognition criteria are applied. 

In addition, the invention permits substantial 
flexibility in the use and development of a char- 
acter vocabulary in a system for recognition of 
complex characters such as Kanji. Strokes may be 
added and parameters of strokes may be changed 
to accommodate different types of complex char- 
acters, and selected stroke templates can be ig- 
nored during the recognition process. 

The invention eliminates the accumulation of 
much unneeded information about stroke form and 
minimizes the amount of processing required to 



recognize a stroke resulting in accurate stroke and 
character recognition at reasonable cost. The in- 
vention has numerous applications. It may be used 
in an interactive word processing and data entry 

5 system. Other purposes include use as a teaching 
tool to train individuals in stroke writing and pre- 
ferred stroke sequence and as a lexicon aid for 
looking up meanings of characters as well as pat- 
tern recognition applications in general. 

10 As used herein, a confusion group generally 

has reference to a grouping of elements which 
have distinguishable significance yet which are 
subject to confusion due to similarity in shape. A 
confusion group may in some instances be indistin- 

75 guishable from an individual stroke in treatment 
and use, representing for example a hybrid or 
average of strokes. The term may also mean to 
include a grouping of strokes found desirable not to 
identify separately. 

20 The invention is explained with reference to a 

specific embodiment in which curvature character- 
istics of a stroke are registered in the form of a set 
of vectors and specifically the polar coordinate 
representation of vectors at the position of each 

25 selected sample point along the stroke. The inven- 
tion, however, is not limited to the specific recogni- 
tion system described herein employing point for 
point matching. Moreover, the invention can be 
used in connection with other pattern recognition 

30 techniques. The invention will be better understood 
by reference to the following detailed description 
taken in connection with the accompanying draw- 
ings. 

35 BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a block diagram of a written character 
processing system according to the invention. 

Fig. 2 is a block diagram of a processing 
■to device operative in accordance with the invention. 

Fig. 3 is a table of strokes illustrating confusion 
grouping according to the invention. 

Fig. 4 is a flow chart of a method according to 
the invention. 

45 

DESCRIPTION OF SPECIFIC EMBODIMENTS 

As used herein, & pattern is any two-dimen- 
sional shape or sequence of movement, such as a 
50 character. A template is any set of criteria about a 
pattern element. Specifically, a template may be a 
set of data points and criteria about data points, the 
locus of which define the shape of an element of a 
pattern, such as a stroke. An element is any dis- 
ss crete characteristic of a pattern. A stroke is a 
specific type of element. Strokes of different 
shapes are recognizable as having distinguishable 
significance. A character is any combination of 
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strokes or elements, such as a pattern used in a 
writing system. 

Referring to Fig. 1, there is shown a hand- 
written pattern identification system 10 which is 
used as a character recognition device. It includes 
a stroke registration means 12 which generates first 
signals representing first signal information describ- 
ing written strokes as hereinafter explained, the first 
signals being supplied by a first signal line 14 to a 
data processing means 16. The processing means 
is operative to compare second signals represent- 
ing second signal information describing template 
strokes to yield signals on signal line 20 identifying 
the character or specific portion of the written char- 
acter under examination. An output signal line 20 of 
the data processing means 16 is connected to a 
graphic translator 22 which in turn drives a char- 
acter output device 24 such as a visual display, a 
printer or other character processing device. 

It has been discovered that too much informa- 
tion about a character or stroke can be as det- 
rimental to identification as too little information. 
Yet it has also been discovered that at least initially 
substantial care must be taken to adequately regis- 
ter the information even if the entire amount of 
information is not used in the recognition process. 
It is an object of the invention to identify the 
element and pattern as early as possible and to 
discard as much information as possible to both 
increase accuracy and to avoid storage and pro- 
cessing of unneeded or redundant information. 

There are many possible embodiments of the 
invention. The following description sets forth but a 
few representative embodiments. 

The data processing means 16 includes means 
for comparing the first signal information, the input 
signals, with the second signal information, the 
stored signal, to obtain an identification of the 
stroke under examination, the third signal informa- 
tion, as hereinafter explained. In the specific em- 
bodiment, only curvature characteristics of the 
stroke under examination are specified, and gen- 
erally to a relatively high degree of precision, for 
use in identifying and labeling the stroke under 
examination. Other criteria, such as character com- 
plexity or stroke complexity may likewise be em- 
ployed. The second signal information describing 
the model strokes takes the form of stroke tem- 
plates. A stroke template is generally an ordered 
set of values at second data points generally cor- 
responding to sampled or interpolated positions of 
the first data points of the stroke under examina- 
tion. The values of the second data points may 
define relative position and permissible deviation 
values at each second data point. A confusion 
group template may substitute for several stroke 
templates. 

The processing means 16 may include a stroke 



processor 26, a template memory 28 (capable of 
storing stroke and confusion group templates), a 
character processor 30 and a character dictionary 
memory 32. The template memory 28 is respon- 

5 sive to a signal on a template address select line 
34 to provide second signals on a second signal 
line 18 to the stroke processor 26. The stroke 
processor 26 identifies the stroke represented by 
the first signal information based on the second 

w signal information. The first signal information ap- 
plied to the stroke processor 26 may include stroke 
positional information which is generally normalized 
relative to the starting point of each stroke, since 
stroke position relative to other strokes plays no 

75 part in the initial stroke recognition process of a 
preferred embodiment. A label signal is provided at 
the output of the stroke processor 26 on a label 
line 36 to the character processor 30. 

The character dictionary 32 is responsive to a 

20 signal on a character address select line 33 to 
provide signals on a character label line 31 to the 
character processor 30. The character processor 
30 is operative to identify one or more characters 
represented by a collection of stroke and/or confu- 

25 sion group label signals. Once the total set of 
unknown input signals generated from writing an 
entire character is recognized as possibly one or 
more characters, a character label signal is pro- 
vided at the output of the character processor 30 

30 on signal line 20, where the graphic translator 22 
recognizes the character or characters and the 
output device displays or permits the characters to 
be displayed. If two or more characters are to be 
displayed, the stroke registration device 12 may 

35 then be used by an operator to select or otherwise 
indicate which of the displayed characters is the 
intended character. In the event of ambiguous iden- 
tification or upon default, the intended character is 
then registered or otherwise used in a data pro- 

40 cessing sequence. 

Turning to Fig. 2, there is shown a portion of 
one data processing means 16 according to the 
invention. The apparatus is operative as follows: 
Raw data which may be in the form of a 

45 stream of x-y coordinate pairs generated as a func- 
tion of time are transformed by a preprocessor 46 
into a standard number of x-y coordinate pairs, for 
example thirty in accordance with a particular 
prototypical embodiment, spaced evenly along the 

so length of a stroke under examination. The precise 
values of the x and y coordinates are interpolated 
from the raw data as if the strokes were drawn at a 
constant rate. The interpolated x-y coordinate pairs 
are initially converted to a plurality of discrete, 

55 straight-line, standard-length vectors with specific 
angle values in polar coordinates specifying stroke 
curvature characteristics along the contour of the 
stroke under examination. The angle value is ex- 
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pressed as the arctangent of the chord between 
two data points in a reference plane. Alternatively 
the value at one data point could be the change in 
angle relative to the previous data point. The output 
of the preprocessor 46 is provided via data line 48 
to an array memory 50. The data line 48 is a bus 
which carries the position value signals. A write 
address line 52 may be coupled from the 
preprocessor 46 to the array memory 50 to provide 
the stroke position address of the corresponding 
stroke position data. The array memory 50 gen- 
erally has capacity to store at least all the contour 
information of an entire character. Each character 
is normally stored in stroke sequence. The array 
memory 50 in a particular embodiment may there- 
fore be a simple shift register which is operative on 
a first-in, first-out basis, thus eliminating a need for 
an explicit write address line. The data output of 
the array memory 50, in the form of the position 
value signals values, is provided to a distance 
metric calculator 54 via date line 56. The distance 
metric calculator 54 may generate read address 
information through a read address line 58 to the 
array memory 50. (A distance metric is a math- 
ematical value which represents a separation value 
between any two contours.) The purpose of the 
distance metric calculator 54 is to compute the 
closeness of the curvature characteristics of the 
stroke under examination to each one of a set of 
stroke templates. The stroke templates according 
to a specific embodiment of the invention are 
stored in a stroke template memory 28 in the form 
of a two column array of angular position values T 
and corresponding permissible angular deviation 
values <j|. 

Confusion group templates may take the form 
of stroke templates as stored. Confusion group 
templates may be superpositioned combinations of 
stroke templates or groups of distinguishable 
stroke templates each assigned a common label. 
Confusion group templates formed of a group of 
superpositioned stroke templates may retain the 
deviation values associated with each point, or new 
deviation values may be assigned to account for 
the locus of points generally falling between super- 
positioned stroke templates. 

The template data points, or second data 
points, are selected to correspond with the first 
data points, or data points of the stroke under 
examination, in both number and position. Figure 3 
gives a representative set of the template strokes 
empirically derived from the Kanji. The permissible 
angular deviation values for each stroke are a set 
of up to thirty positive values which may differ from 
point to point and from stroke to stroke. The values 
are based on experimentation and so are not given 
here. A stroke code has been arbitrarily assigned 
to each one of the template strokes except as 



hereinafter explained. The stroke code may be an 
ASCII character or the like which is suitable for 
data storage. 

The distance metric for each template is cal- 
5 culated in one of various ways. In a first alternative, 
the difference in angular displacement between the 
stroke under examination and the template is cal- 
culated by subtracting the angular value of each 
first data point from the angular value of the cor- 

w responding second data point to obtain an absolute 
value. The resultant difference value is then divided 
by the corresponding permissible angular deviation 
value to obtain a positive resultant value at each 
data point, and all of the positive resultant values 

75 are summed. In a second alternative, the resultant 
values are each squared and the new resultant 
values are summed. In the latter case, the mini- 
mum distance metric is analogous to a least 
squared difference representation of the separation 

20 between the first and second data points weighted 
by the permissible angular deviation value. 

The distance metric calculator 54 (Fig. 2) 
draws the position value and the related deviation 
value for each second data point of each template 

25 from the stroke template memory 28 through sec- 
ond signal lines 18, one line being for the position 
value and the other line being for the deviation 
value. A single metric value for each template is 
provided through a metric data line 60 to a metric 

30 memory table 62. The metric data is sorted in the 
metric memory table 62 in order from least to 
greatest. A write address line 64 conveys the initial 
address signals from the distance metric calculator 
to the table 62. A sorter 66 coupled to the table 62 

35 may be employed to rearrange the order of the 
data. In a first embodiment confusion group tem- 
plates are treated precisely as if they were stroke 
templates. A confusion group template may replace 
several stroke templates. In such an embodiment it 

40 may be unnecessary to employ a confusion group- 
ing processor 65 as hereinafter explained. 

Where the confusion group consists of several 
strokes which retain some individual characteris- 
tics, the sorted metric data output of the table 62 

45 may be coupled via a data line 69 to a confusion 
grouping processor 65. The confusion grouping 
processor 65 is operative in alternative ways, in 
addition to direct substitution for more than one 
stroke template, as above, depending on the par- 

50 ticular embodiment. According to a second em- 
bodiment, the confusion grouping processor 65 
flags or otherwise notes which stroke templates are 
members of the confusion group of the stroke 
template producing the least distance metric D| for 

55 the stroke under examination. Thereafter metrics 
corresponding to those stroke templates are ig- 
nored in the label selection process carried out by 
the metric selector circuitry. This technique permits 
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a unique stroke identity label to an individual mem- 
ber of a confusion group notwithstanding the fact 
that the member of the confusion group does not 
satisfy all recognition criteria when compared with 
other members of the confusion group. 

In a third embodiment, the confusion grouping 
processor 65 is operative to substitute a confusion 
group label for each template stroke which is a 
member of a confusion group. That is, the tem- 
plates of all characters within a confusion group are 
treated as if they represented a single character, 
yet each of the recognition criteria applicable to a 
particular stroke type is retained for the purpose of 
metric calculation. In this alternative, the confusion 
group label is substituted for the stroke label in the 
character memory 86 as hereinafter explained. 
(The character dictionary may include a vocabulary 
containing characters defined in terms of combina- 
tions of character labels and confusion group la- 
bels, including alternates of combinations.) 

The output of the metric memory table 62 in 
the first embodiment, or in the second or third 
embodiments, the output of the confusion group 
processor 65, is coupled to a recognition criteria 
calculator 70 and to a metric selector 72 through a 
data line 68. An address line 67 allows the metric 
selector 72 or the recognition criteria calculator 70 
to access the metrics stored in the metric memory 
table 62. 

In the recognition criteria calculator 70, confu- 
sion groups and separately defined strokes are 
treated the same in computing decision threshold 
values as hereinafter explained. The recognition 
criteria calculator 70 compares the least distance 
metric D, and at least the next least distance metric 
D i+1 of the template not belonging to the same 
confusion group as the template corresponding to 
the least distance metric Di against three specific 
threshold criteria. The threshold criteria are an up- 
per decision threshold Du, a lower decision thresh- 
old D L , and a difference decision threshold D D . The 
upper decision threshold D u and the lower decision 
threshold D L as well as the difference decision 
threshold D D are assigned values based on 
preselected tolerance levels. The lower decision 
threshold D L is the size of the largest acceptable 
distance metric between the template most closely 
matching the stroke under examination. The upper 
decision threshold Du is the smallest permissible 
distance metric for the template with the next least 
distance metric D i + 1 . The difference decision 
threshold D D is the sum of the least distance metric 
D| and a separation D d . These three decision cri- 
teria are compared with the least distance metric D| 
and next least distance metric D i + 1 , taking into 
account confusion grouping, in the first comparator 
74, second comparator 76, third comparator 77 and 
fourth comparator 78. First comparator 74 pro- 
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duces a positive indication if the least distance 
metric Di is less than the lower decision threshold 
D L . The second comparator 76 produces a positive 
logic indication if the next least distance metric 

5 D i+1 (not of the same confusion group) is greater 
than the upper decision threshold D u . The outputs 
of the first comparator 74 and second comparator 
76 are provided to an AND gate 80 so that the 
respective positive logic indications jointly indicate 

w stroke recognition. 

The third comparator 77 produces a positive 
logic indication if the value of the least distance 
metric D| is less than the upper decision threshold 
Dy. The fourth comparator 78 produces a positive 

75 logic indication if the value of the next least dis- 
tance metric D i+1 is greater than the difference 
decision threshold D D , where D D = Di + D d . The 
output of the third comparator 77 is provided to- 
gether with the output of the fourth comparator 78 

20 to an AND gate 81 so that the joint positive logic 
conditions indicate stroke recognition. The outputs 
of AND gates 80 and 81 are coupled to an OR gate 
82. If either of the input criteria is positively satis- 
fied, then the system 26 indicates it has recognized 

25 the stroke under examination. A stroke label mem- 
ory is activated by a TRUE output signal from OR 
gate 82. In response, the stroke label (or confusion 
group label) of the template corresponding to the 
least distance metric D, meeting the recognition 

30 criteria is loaded via line 85 into a character mem- 
ory 86 at the current stroke address, as indicated 
by write stroke address line 88. 

The character processor 30 includes the char- 
acter memory 86 and a character comparator 96 to 

35 which the character dictionary memory 32 is coup- 
led. Character data via a character data line 98 is 
communicated from the character memory 86 to 
the character comparator 96 in response to the 
presentation of a read stroke address command to 

40 the character memory 86. The character compara- 
tor 96 iterates through the character dictionary 
memory 32 by presenting a character address to a 
character dictionary memory 32 through a char- 
acter address line 100. Labels of corresponding 

45 strokes, or strokes and confusion groups, of the 
addressed character and the character under ex- 
amination are presented to the character compara- 
tor 96 through a character dictionary line 102 in 
response to the character address and the read 

so stroke address command. The character compara- 
tor 96 is operative to compare the labels from the 
character memory with the labels from the dic- 
tionary memory 32 to provide as its output a signal 
in the form of a character label whenever a char- 

55 acter is recognized. 

A particularly powerful tool of one inventive 
embodiment is the distance metric calculator 54 
and the stroke template set wherein the stroke 
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template set comprises both polar position informa- 
tion and permissible angular deviation information. 
This form of template eliminates the need for addi- 
tional templates to take into account stroke vari- 
ations which do not precisely satisfy the template 
criteria. In connection with this tool, confusion 
group templates and/or the confusion grouping pro- 
cessor 65 are also provided which groups template 
strokes of distinguishable significance in various 
ways in confusion groups for recognition purposes. 
The least distance metric may correspond to the 
actual stroke written, yet the distinction between 
the actual stroke written and other strokes in the 
same confusion group may be insufficient to distin- 
guish between other strokes in the same confusion 
group when identified by the distance metric tech- 
nique according to the invention. 

Referring to Fig. 3, there is shown a specific 
example of one confusion grouping scheme for 
Kanji. An unknown stroke may be recognized as 
being a member of a first confusion group consist- 
ing of strokes labeled A, E and U. When compared, 
the distance metrics between strokes labeled A, E 
and U are insufficiently distinguishable on the basis 
of all of the decision criteria. Under those cir- 
cumstances, the least distance metric is presumed 
to have the stroke label "A" corresponding to the 
unknown stroke even though all of the decision 
criteria are not met for strokes E and U. Alter- 
natively, the least distance metric is assigned the 
confusion group label "1". Alternatively, a separate 
confusion group template is provided which is the 
locus of all template strokes A, E and U and all 
points in between and which has the label "1 ". 

Fig. 3 identifies six confusion groups by label 
from a catalog of twenty-five distinguishable 
strokes. Each confusion group consists of two to 
five strokes. Five strokes belong to no confusion 
group. 

The character comparator 96 might recognize 
a string of confusion group and stroke identity 
labels as representative of more than one char- 
acter. In such an event and in accordance with one 
embodiment, more than one character may be 
presented in display, and the user may select from 
among the presented characters the particular 
character which was intended to be registered. The 
number of characters that presented in this manner 
due to stroke ambiguity is relatively small, permit- 
ting the user to rapidly identify and specify the 
character ultimately intended. 

Referring now to Fig. 4, there is a flow chart for 
a computer implementation of one embodiment of 
stroke recognition according to the invention. The 
distance metrics for all templates with respect to a 
particular stroke under examination are calculated 
(Step A). Thereafter, the distance metrics are sort- 
ed in order from least to greatest (Step B). The 



least distance metric corresponds to the most likely 
candidate for designating the label of the stroke 
under examination. However, stroke templates are 
flagged according to the confusion group of the 

5 least distance metric (step C). Thereupon, recogni- 
tion criteria are applied according to the invention 
to enhance the likelihood of accurate recognition. 
First, the least distance metric is compared with 
the lower decision threshold (Step D). If the indica- 

jo tion is that the least distance metric D, is less than 
the lower decision threshold D L , the next least 
distance metric D i+1 is compared with the upper 
decision threshold Du to see if the next least dis- 
tance metric D F+1 exceeds the upper decision 

15 threshold Dy (Step E). If the indication is positive, 
then according to this embodiment the system 
writes the stroke label to the character memory 
indicating recognition of the stroke (Step F). If the 
indication of either of the previous threshold level 

20 tests is negative, an alternative decision process is 
applied. First, the least distance metric Dj is com- 
pared with the upper decision threshold D u to see 
if the distance metric Di is lesser (Step G). If yes, 
then the next least distance metric D i+1 is com- 

25 pared with the difference decision threshold D D to 
see if the decision threshold D D is exceeded (Step 
H). The difference decision threshold D D is the sum 
of the least distance metric and the preselected 
separation D d . If the indication is positive, then the 

30 system indicates that it has recognized the stroke 
and writes the stroke label in the character memory 
(Step F). Should the decision threshold criteria not 
be met, a question mark is generated as the stroke 
label (Step J), and the system goes on to the next 

35 stroke. The process is repeated for each stroke 
(Steps K and L) until all strokes in a character are 
processed. Thereafter a character dictionary search 
(Step M) is conducted to identify the character. 
In one of the alternative embodiments wherein 

40 a stroke is recognized and is a member of a 
confusion group, the step (F) writes the confusion 
group label to the stroke memory. 

One of the advantages of the confusion group 
labeling technique is flexibility in vocabulary. Mul- 

45 tiple stroke templates are available for analysis 
purposes. Superpositioned templates may be in- 
corporated into a single template having the de- 
viation values adjusted to cover all generalizations 
of strokes or any grouping of strokes. 

so There is substantial advantage to the use of 
confusion group identity labels as a substitute for 
stroke labels in the character dictionary. With fewer 
equivalent strokes, the number of data bits required 
to store each label can be minimized, allowing for 

55 smaller, more efficient memory. 

The invention has now been explained with 
reference to specific embodiments. Other embodi- 
ments will be apparent to those of ordinary skill in 
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the art. It is therefore not intended that this inven- 
tion be limited except as indicated by the appen- 
ded claims. 

Claims 

1. A method for identifying patterns by means of 
a pattern registration, in the sense of data 
capture, means (12) and a data processing 
means (16), wherein each pattern comprises at 
least one element based upon relative move- 
ment between a pattern-forming means and a 
pattern-accommodating means of that registra- 
tion means (12), said registration means (12) 
being operative to generate first signal informa- 
tion represented by first signals representative 
of information about said element, said data 
processing means (16) being provided with 
prestored second signal information represent- 
ed by second signals, said second signal in- 
formation being defined as templates defining 
at least one reference element for comparison 
with said first signal information, said data pro- 
cessing means (16) being operative to gen- 
erate third signals representing third signal in- 
formation, said third signal information com- 
prising specification of identity labels for use in 
developing coded information specifying said 
pattern, said data processing means (16) being 
operative to compare said first signal informa- 
tion for best match with said second signal 
information to obtain an identity label corre- 
sponding to said first signal information, 

characterized in that: 

at least one of said templates is designated as 
a confusion group template representing at 
least two of such reference elements being 
provisionally treated as equivalent. 

2. The method according to claim 1 wherein said 
data processing means (16) compares said 
first signal information and said second signal 
information to yield as an identity label a spe- 
cific element label for the element under ex- 
amination. 

3. The method of claim 1, wherein said data 
processing mean (16) compares said first sig- 
nal information and said second signal informa- 
tion to yield an identity label specifying a con- 
fusion group. 

4. The method according to claim 1 or 2 or 3, 
wherein said first signal information comprise 
stroke information characterized in that said 
elements are strokes and a confusion group 



template comprises information about at least 
two strokes, characterized in that said data- 
processing-means-compare step comprises: 

(a) generating distance metric values repre- 
s sentative of closeness between said first 

signal information and said second signal 
information representative of stroke shape 
for each one of said templates; 

(b) sorting said distance metric values to 
io identify which of said templates yields the 

least of said distance metric values. 

5. In the method according to claim 4, wherein 
said sorting step includes ignoring decision 
15 criteria for distinguishing between template 

strokes where one template stroke is included 
in a confusion group of another template 
stroke. 

20 6. The method according to claim 4, character- 
ized in that: 

said processing means (16) calculates the 
distance between corresponding first signal in- 
formation and second signal information to ob- 

25 tain a set of difference values; 

said data processing means (16) divides 
each one of said difference values by a cor- 
responding permissible angular deviation value 
to obtain a set of resultant values; and 

30 said data processing means (16) sums all 

said resultant values to obtain said distance 
metric value. 

7. The method according to claim 4, character- 
35 ized in that said data processing means cal- 

culates the distance between corresponding 
first signal information and second signal in- 
formation to obtain a set of difference values; 
said data processing means (16) divides 
40 each one of said difference values by a cor- 

responding permissible angular deviation value 
to obtain a set of resultant values; 

said data processing means (16) squares 
each one of said resultant values to obtain a 
45 set of squared values; and 

said data processing means (16) sums all 
said squared values to obtain said distance 
metric value. 

so 8. The method according to claim 4, character- 
ized in that said data processing means (16) 
compares said distance metric values of said 
stroke under examination for each template 
with an upper decision threshold value (Dy), 

55 with a lower decision threshold value (D L ) and 

with a difference decision threshold value (D D ) 
to yield said identity label. 



17 



EP 0 114 250 B1 



18 



9. The method according to claim 8, character- 
ized in that said data processing means (16) 
assigns an identity label to said stroke under 
examination if the least distance metric value 
is less than the lower decision threshold value 
(D L ) and if the next least distance metric value 
is greater than the upper decision threshold 
value (Du). 

10. The method according to claim 8, character- 
ized in that said data processing means (16) 
assigns an identity label to said stroke under 
examination if the least distance metric value 
is less than the upper decision threshold value 
(Dj) and if the next least distance metric value 
is greater than the sum of the least distance 
metric value and the difference decision 
threshold value (D D ). 

11. The method according to claim 1 or 2 or 3, 
wherein said elements are strokes having de- 
finable shape and wherein at least one of said 
templates is a confusion group template con- 
sisting of at least two template strokes, each 
said template stroke comprising a set of posi- 
tion values specifying position along a stroke 
and a set of deviation values specifying per- 
missible deviation from said position values, 
characterized in that said data processing 
means (16) indicates a match with a confusion 
group template when said first signal informa- 
tion aligns with a specific template stroke 
which is included in a confusion group tem- 
plate. 

12. The method according to claim 1 or 2 or 3 
characterized in that said data processing 
means (16) indicates a match with a confusion 
group template whenever said first signal in- 
formation is included within a locus circum- 
scribing all of said pattern information of said 
confusion group template. 

13. An apparatus for identifying patterns by means 
of a pattern registration in the sense of data 
capture, means (12) and a data processing 
means (1 6), wherein each pattern comprises at 
least one element based upon relative move- 
ment between a pattern-forming means and a 
pattern-accommodating means of said registra- 
tion means (12), said registration means (12) 
being operative to generate signal information 
represented by first signals representative of 
information about said elements, said data pro- 
cessing means (16) being provided with 
prestored second signal information represent- 
ed by second signals, said data processing 
means (16) including means (28) for storing 



said second signal information as templates for 
comparison with said first signal information, 
each of said templates defining at least one 
reference element, said data processing 

5 means (16) being operative to generate third 

signals representing third signal information, 
said third signal information comprising speci- 
fication of identity labels for use in developing 
coded information specifying said pattern, said 

jo data processing means (16) including means 

(54, 62, 65, 66) for comparing said first signal 
information for best match with a subset of 
selected second signal information to obtain an 
identity label corresponding to said first signal 

75 information, 

characterized in that: 

at least one of said templates is designated as 
20 a confusion group template comprising infor- 

mation about at least two of such reference 
elements being provisionally treated as equiv- 
alent. 

25 14. The apparatus according to claim 13, wherein 
said comparing means (54,62,65,66) of said 
data processing means (16) is operative to 
compare said first signal information and said 
second signal information to yield as an iden- 

30 tity label a specific element label for the ele- 
ment under examination. 

15. The apparatus of claim 13, wherein said com- 
paring means (54,62,65,66) of said data pro- 
35 cessing means (16) is operative to compare 

said first signal information and said second 
signal information to yield an identity label 
specifying a confusion group. 

40 16. The apparatus according to claim 13, wherein 
said first signal information and said second 
signal information specify relative position 
along a stroke describing stroke shape, char- 
acterized in that said data processing means 

45 (1 6) includes: 

means for calculating the distance be- 
tween corresponding first signal information 
and second signal information to obtain a set 
of difference values; 

so means for dividing each one of said dif- 

ference values by a permissible angular de- 
viation value corresponding to each location of 
first signal information along a stroke to obtain 
a set of resultant values; and 

55 means for summing all said resultant val- 

ues to obtain a distance metric value. 

17. The apparatus according to claim 13, wherein 
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said first signal information and said second 
signal information specify relative position 
along a stroke describing stroke shape, char- 
acterized in that said processing means (16) 
includes: 

means for calculating the distance be- 
tween corresponding first signal information 
and second signal information to obtain a set 
of difference values; 

means for dividing each one of said dif- 
ference values by a permissible angular de- 
viation value corresponding to each location of 
first signal information along a stroke to obtain 
a set of resultant values; 

means for squaring each one of said resul- 
tant values to obtain a set of squared values; 
and 

means for summing all of said set of 
squared values to obtain said distance metric 
value. 

18. The apparatus according to claim 15, char- 
acterized in that said data processing means 
(1 6) includes means (70,72) for comparing said 
distance metric values of said stroke under 
examination for each template with an upper 
decision threshold value (Dj), with a lower 
decision threshold value (D L ) and with a dif- 
ference decision threshold value (D D ) to yield 
said identity label. 

19. The apparatus according to claim 18, char- 
acterized in that said data processing means 
(16) includes means (84) for assigning in iden- 
tity label to said stroke under examination if 
the least distance metric value is less than the 
lower decision threshold value (D L ) and if the 
next least distance metric value is greater than 
the upper decision threshold value {D u ). 

20. The apparatus according to claim 18, char- 
acterized in that said data processing means 
(16) assigns an identity label to said stroke 
under examination if the least distance metric 
value is less than the upper decision threshold 
value (Dy) and if the next least distance metric 
value is greater than the sum of the least 
distance metric value and the difference de- 
cision threshold value (Do). 

Revendications 

1. Procede pour identifier des formes a I'aide 
d'un moyen (12) d'enregistrement de formes 
dans le sens de saisie de donnees et d'un 
moyen de traitement de donnees (16) dans 
lequel chaque forme comprend au moins un 
element base sur le deplacement relatif entre 



un moyen de formation de formes et un 
moyen de prise en compte de formes de ce 
moyen d'enregistrement (12), ledit moyen 
d'enregistrement (12) etant fonctionnel pour 
5 produire des premieres informations de si- 

gnaux representees par des premiers signaux 
representatifs des informations concernant ledit 
element, ledit moyen de traitement de don- 
nees (16) etant prevu avec des secondes infor- 

w mations de signaux prememorisees represen- 

tees par des seconds signaux, lesdites secon- 
des informations de signaux etant definies cor- 
ne gabarit definissant au moins un element de 
reference pour une comparaison avec lesdites 

75 premieres informations de signaux, ledit 

moyen de traitement de donnees (16) etant 
fonctionnel pour produire des troisiemes si- 
gnaux representant des troisiemes informations 
de signaux, lesdits troisiemes informations de 

20 signaux comprenant la specification d'etiquet- 

tes d'identite utilisees dans le developpement 
d'informations codees specifiant ladite forme, 
ledit moyen de traitement de donnees (16) 
etant fonctionnel pour comparer lesdites pre- 

25 mieres informations de signaux en vue de leur 

meilleure concordance avec lesdites secondes 
informations de signaux afin d'obtenir une eti- 
quette d'identite correspondant auxdites pre- 
mieres informations de signaux, 

30 caractirise en ce que : 

au moins un desdits gabarits est congu 
comme un gabarit de groupe de confusions 
representant au moins deux de ces elements 
de reference traites provisoirement comme 

35 equivalents. 

2. Procede selon la revendication 1, dans lequel 
ledit moyen de traitement de donnees (16) 
compare lesdites premieres informations de 
■to signaux et lesdites secondes informations de 

signaux pour donner comme etiquette d'identi- 
te une etiquette d'element specifique pour 
I'element sous examen. 

45 3. Procede selon la revendication 1, dans lequel 
ledit moyen de traitement de donnees (16) 
compare lesdites premieres informations de 
signaux et lesdites secondes informations de 
signaux pour donner une etiquette d'identite 

50 specifiant un groupe de confusions. 

4. Procede selon la revendication 1 ou 2 ou 3, 
dans lequel lesdites premieres informations de 
signaux comprennent des informations de 
55 traits caracterise en ce que lesdits elements 

sont des traits et un gabarit de groupe de 
confusions comprend des informations concer- 
nant au moins deux traits, caracterise en ce 
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que ladite etape de comparaison du moyen de 
traitement de donnees comprend : 

(a) la production de valeurs metriques de 
distance representatives de la concordance 
entre lesdites premieres informations de si- 5 
gnaux et lesdites secondes informations de 
signaux representatives de la forme du trait 
pour chacun desdits gabarits ; 

(b) le tri desdites valeurs metriques de dis- 
tance pour identifier lequel des gabarits 10 
donne la moindre valeur desdites valeurs 
metriques de distance. 

5. Dans le precede conformement a la revendica- 

tion 4, dans lequel ladite etape de tri comporte 15 
des criteres de decision d'omission pour faire 
la distinction entre des traits de gabarit lors- 
qu'un trait de gabarit est inclus dans un grou- 
pe de confusions d'un autre trait de gabarit. 

20 

6. Precede selon la revendication 4, caracterise 
en ce que : 

ledit moyen de traitement (16) calcule la 
distance entre les premieres informations de 
signaux et des secondes informations de si- 25 
gnaux correspondantes pour obtenir un en- 
semble de valeurs de difference ; 

ledit moyen de traitement de donnees (16) 
divise chaque valeur desdites valeurs de diffe- 
rence par une valeur de deviation angulaire 30 
admissible pour obtenir un ensemble de va- 
leurs resultantes ; et 

ledit moyen de traitement de donnees (16) 
somme la totalite desdites valeurs resultantes 
pour obtenir ladite valeur metrique de distance. 35 

7. Precede selon la revendication 4, caracterise 
en ce que ledit moyen de traitement de don- 
nees calcule la distance entre les premieres 
informations de signaux et les secondes infor- 40 
mations de signaux correspondantes pour ob- 
tenir un ensemble de valeurs de difference ; 

ledit moyen de traitement de donnees (16) 
divise chaque valeur desdits valeurs de diffe- 
rence par une valeur de deviation angulaire 45 
admissible correspondante pour obtenir un en- 
semble de valeurs resultantes ; 

ledit moyen de traitement de donnees (16) 
eleve au carre chaque valeur desdites valeurs 
resultantes pour obtenir un ensemble de va- 50 
leurs au carre ; et 

ledit moyen de traitement de donnees (16) 
somme la totalite desdites valeurs au carre 
pour obtenir ladite valeur metrique de distance. 

55 

8. Precede selon la revendication 4, caracterise 
en ce que ledit moyen de traitement de don- 
nees (16) compare lesdites valeurs metriques 



de distance dudit trait sous examen pour cha- 
que gabarit a une valeur superieure de seuil 
de decision (Du), a une valeur inferieure de 
seuil de decision (D,) et a une valeur de diffe- 
rence de seuil de decision (D D ) pour produire 
ladite etiquette d'identite. 

9. Precede selon la revendication 8, caracterise 
en ce que ledit moyen de traitement de don- 
nees (16) affecte une etiquette d'identite audit 
trait sous examen si la valeur metrique de 
distance moindre est inferieure a la valeur infe- 
rieure de seuil de decision (D,) et si la prochai- 
ne valeur metrique de distance moindre est 
superieure a la valeur superieure du seuil de 
decision (D„). 

10. Precede selon la revendication 8, caracterise 
en ce que ledit moyen de traitement de don- 
nees (16) affecte une etiquette d'identite audit 
trait sous examen si la valeur metrique de 
distance moindre est inferieure a la valeur su- 
perieure du seuil de decision (D u ) et si la 
prochaine valeur metrique de distance moindre 
est superieure a la somme de la valeur metri- 
que de distance moindre et de la valeur de 
difference du seuil de decision (D D ). 

11. Precede selon la revendication 1, 2 ou 3, dans 
lequel lesdits elements sont des traits ayant 
des formes definissables et dans lequel au 
moins un desdits gabarits est un gabarit de 
groupe de confusions constitue d'au moins 
deux traits de gabarit, chacun desdits traits de 
gabarit comprenant un ensemble de valeurs de 
position specifiant la position le long d'un trait 
et un ensemble de valeurs de deviation admis- 
sible specifiant la deviation autorisable a partir 
desdites valeurs de position, caracterise en ce 
que ledit moyen de traitement de donnees (16) 
indique une concordance avec un gabarit de 
groupe de confusions lorsque lesdites premie- 
res informations de signaux s'alignent avec un 
segment de gabarit specifique qui est inclus 
dans un gabarit de groupe de confusions. 

12. Precede selon la revendication 1 ou 2 ou 3, 
caracterise en ce que le moyen de traitement 
de donnees (16) indique une concordance 
avec un gabarit de groupe de confusions cha- 
que fois que lesdites premieres informations 
de signaux sont incluses a I'interieur d'un lieu 
entourant la totalite desdites informations de 
forme dudit gabarit de groupe de confusions. 

13. Dispositif pour identifier des formes au moyen 
d'un moyen d'enregistrement de formes dans 
le sens de saisie de donnees (12) et d'un 
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moyen de traitement de donnees (16), dans 
lequel chaque forme comprend au moins un 
element base sur le deplacement relatif entre 
un moyen de formation de formes et un 
moyen de prise en compte de formes dudit 5 
moyen d'enregistrement (12), ledit moyen 
d'enregistrement (12) etant fonctionnel pour 
produire des informations de signaux represen- 
tees par des premiers signaux representatifs 
des informations en ce qui concerne lesdits w 
elements, ledit moyen de traitement de don- 
nees (16) etant alimente avec des secondes 
informations de signaux prememorisees repre- 
sentees par des seconds signaux, ledit moyen 
de traitement de donnees (16) comportant un 75 
moyen (28) pour stacker lesdites secondes 
informations de signaux comme gabarit pour 
une comparaison avec lesdites premieres in- 
formations de signaux, chacun desdits gabarits 
definissant au moins un element de reference, 20 
ledit moyen de traitement de donnees (16) 
etant fonctionnel pour produire des troisiemes 
signaux representant des troisiemes informa- 
tions de signaux, lesdites troisiemes informa- 
tions de signaux comprenant la specification 25 
d'etiquettes d'identite utilisees dans le deve- 
loppement d'informations codees specifiant la- 
dite forme, ledit moyen de traitement de don- 
nees (16) comportant des moyens (54, 62, 65, 
66) pour comparer lesdites premieres informa- 30 
tions de signaux en vue de leur meilleure 
concordance avec un sous-ensemble des se- 
condes informations de signaux selectionnees 
afin d'obtenir I'etiquette d'identite correspon- 
dent auxdites premieres informations de si- 35 
gnaux, 

caracterise en ce que : 

au moins un desdits gabarits est congu 
comme un gabarit de groupe de confusions 
comprenant des informations concernant au 40 
moins deux de ces elements de reference 
traites provisoirement comme equivalents. 

14. Dispositif selon la revendication 13, dans le- 
quel lesdits moyens de comparaison (54, 62, 45 
65, 66) dudit moyen de traitement de donnees 

(16) sont fonctionnels pour comparer lesdites 
premieres informations de signaux et lesdites 
secondes informations de signaux pour produi- 
re comme etiquette d'identite, une etiquette 50 
specifique de ['element pour I'element sous 
examen. 

15. Dispositif selon la revendication 13, dans le- 
quel lesdits moyens de comparaison (54, 62, 55 
65, 66) dudit moyen de traitement de donnees 

(16) sont fonctionnels pour comparer lesdites 
premieres informations de signaux et lesdites 



secondes informations de signaux pour donner 
une etiquette d'identite specifiant un groupe de 
confusions. 

16. Dispositif selon la revendication 13, dans le- 
quel lesdites premieres informations de si- 
gnaux et lesdites secondes informations de 
signaux specifient des positions relatives le 
long d'un trait decrivant une forme de trait, 
caracterise en ce que ledit moyen de traite- 
ment de donnees (16) comporte : 

un moyen pour calculer la distance entre 
des premieres informations de signaux et se- 
condes informations de signaux correspondan- 
tes pour obtenir un ensemble de valeurs de 
difference ; 

un moyen pour diviser chaque valeur des- 
dites valeurs de difference par une valeur de 
deviation angulaire admissible correspondant a 
chaque emplacement des premieres informa- 
tions de signaux le long d'un trait pour obtenir 
un ensemble de valeurs resultantes, et 

un moyen pour sommer la totalite desdites 
valeurs resultantes pour obtenir une valeur me- 
trique de distance. 

17. Dispositif selon la revendication 13, dans le- 
quel lesdites premieres informations de si- 
gnaux et lesdites secondes informations de 
signaux specifient une position relative le long 
d'un trait decrivant une forme de trait, caracte- 
rise' en ce que ledit moyen de traitement (16) 
comporte : 

un moyen pour calculer la distance entre 
des premieres informations de signaux et des 
secondes informations de signaux correspon- 
dantes pour obtenir un ensemble de valeurs 
de difference ; 

un moyen pour diviser chaque valeur des- 
dites valeurs de difference par une valeur de 
deviation angulaire admissible correspondant a 
chaque emplacement des premieres informa- 
tions de signaux le long d'un trait pour obtenir 
un ensemble de valeurs resultantes ; 

des moyens pour elever au carre chaque 
valeur desdites valeurs resultantes pour obtenir 
un ensemble de valeurs au carre, et 

un moyen pour sommer la totalite desdits 
ensembles des valeurs au carre pour obtenir 
ladite valeur metrique de distance. 

18. Dispositif selon la revendication 15, caracterise 
en ce que ledit moyen de traitement de don- 
nees (16) comporte un moyen (70, 72) pour 
comparer lesdites valeurs metriques de distan- 
ce dudit segment sous examen pour chaque 
gabarit a une valeur superieure de seuil de 
decision (D u ), a une valeur inferieure de seuil 
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de decision (D,) et a une valeur de difference 
de seuil de decision (D D ) pour donner ladite 
etiquette d'identite. 

19. Dispositif selon la revendication 18, caracterise 
en ce que ledit moyen de traitement de don- 
nees (16) comporte un moyen (84) pour affec- 
ter une etiquette d'identite audit trait sous exa- 
men si la valeur metrique de distance moindre 
est inferieure a la valeur inferieure de seuil de 
decision (D|) et si la prochaine valeur metrique 
de distance moindre est superieure a la valeur 
superieure de seuil de decision (D u ). 

20. Dispositif selon la revendication 1 8, caracterise 
en ce que ledit moyen de traitement de don- 
nees (16) affecte une etiquette d'identite audit 
segment sous examen si la valeur metrique de 
distance moindre est inferieure a la valeur su- 
perieure de seuil de decision (D u ) et si la 
prochaine valeur metrique de distance moindre 
est superieure a la somme de la valeur metri- 
que de distance moindre et de la valeur de 
difference de seuil de decision (D D ). 

Patentanspriiche 

1. Verfahren zur Identifizierung von Mustern mit- 
tels einer Musterregistrierungs-(im Sinne einer 
Dateneinfang-)vorrichtung (12) und einer Da- 
tenverarbeitungsvorrichtung (16), wobei jedes 
Muster zumindest ein Element basierend auf 
der relativen Bewegung zwischen einer must- 
erbildenden Vorrichtung und einer musteran- 
passenden Vorrichtung der Registrierungsvor- 
richtung (12) aufweist, wobei die Registrie- 
rungsvorrichtung (12) zur Erzeugung einer er- 
sten Signalinformation, dargestellt durch erste 
Signale, welche Information uber das Element 
darstellen, betreibbar ist, und die Datenverar- 
beitungsvorrichtung (16) mit einer vorgespei- 
cherten zweiten Signalinformation ausgestattet 
ist, welche durch zweite Signale dargestellt ist, 
wobei die zweite Signalinformation definiert ist 
als Schablone, definierend zumindest ein Be- 
zugselement fur den Vergleich mit der ersten 
Signalinformation, und die Datenverarbeitungs- 
vorrichtung (16) zur Erzeugung von dritten Si- 
gnalen darstellend eine dritte Signalinformation 
betreibbar ist, wobei die dritte Signalinforma- 
tion die Spezifizierung von Identitatsbezeich- 
nungen zur Verwendung bei der Entwicklung 
kodierter Information zur Spezifizierung des 
Musters aufweist, wobei die Datenverarbei- 
tungsvorrichtung (16) zum Vergleich der ersten 
Signalinformation fur die beste Ubereinstim- 
mung mit der zweiten Signalinformation zum 
Erhalten einer Identitatsbezeichnung entspre- 



chend der ersten Signalinformation betreibbar 
ist, 

dadurch gekennzeichnet, da/3 

5 

zumindest eine der Schablonen als eine Konfu- 
sionsgruppenschablone ausgewahlt ist, welche 
zumindest zwei derartige Bezugselemente dar- 
stellt, die vorlaufig als aquivalent behandelt 
w sind. 

2. Verfahren nach Anspruch 1, dadurch gekenn- 
zeichnet, daB die Datenverarbeitungsvorrich- 
tung (16) die erste Signalinformation und die 
75 zweite Signalinformation vergleicht, urn eine 

spezifische Elementsbezeichnung als eine 
Identitatsbezeichnung fur das zu prufende Ele- 
ment zu erhalten. 

20 3. Verfahren nach Anspruch 1 , dadurch gekenn- 
zeichnet, dafi die Datenverarbeitungsvorrich- 
tung (16) die erste Signalinformation und die 
zweite Signalinformation vergleicht, urn eine 
Identitatsbezeichnung zur Spezifizierung einer 

25 Konfusionsgruppe zu erhalten. 

4. Verfahren nach Anspruch 1 Oder 2 Oder 3, 
dadurch gekennzeichnet, da/3 die erste Si- 
gnalinformation Strichinformation aufweist, wo- 

30 bei die Elemente Striche darstellen, und eine 

Konfusionsgruppenschablone Information uber 
zumindest zwei Striche aufweist, wobei der 
Datenverarbeitungsvorrichtungs- 
Vergleichsschritt aufweist: 
35 (a) Erzeugen von metrischen Entfernungs- 

werten reprasentierend die Nahe zwischen 
der ersten Signalinformation und der zwei- 
ten Signalinformation reprasentierend die 
Strichform fur jede einzelne der Schablo- 
40 nen; 

(b) Sortieren der metrischen Entfernungs- 
werte zur Identifizierung, welche der Scha- 
blonen den mindesten der metrischen Ent- 
fernungswerte ergibt. 

45 

5. Verfahren nach Anspruch 4, dadurch gekenn- 
zeichnet, da/3 der Sortierschritt Ignorierungs- 
entscheidungskriterien fur das Unterscheiden 
zwischen Schablonenstrichen, bei denen ein 

so einzelner Schablonenstrich in einer Konfu- 

sionsgruppe eines weiteren Schablonenstri- 
ches enthalten ist. 

6. Verfahren nach Anspruch 4, dadurch gekenn- 
55 zeichnet, da/3: 

die Verarbeitungsvorrichtung (16) die Entfer- 
nung zwischen der entsprechenden ersten Si- 
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gnalinformation und der zweiten Signalinforma- 
tion zum Erhalten eines Satzes von Differenz- 
werten berechnet; 

die Datenverarbeitungsvorrichtung (16) jeden 5 
einzelnen der Differenzwerte durch einen ent- 
sprechenden erlaubten Winkelabweichungs- 
wert zum Erhalten eines Satzes resultierender 
Werte dividiert; und 

10 

die Datenverarbeitungsvorrichtung (16) samtli- 
che der resultierenden Werte zum Erhalten 
des metrischen Entfernungswertes addiert. 

7. Verfahren nach Anspruch 4, dadurch gekenn- 75 
zeichnet, da/3 die Datenverarbeitungsvorrich- 
tung die Entfernung zwischen der entsprechen- 
den ersten Signalinformation und der zweiten 
Signalinformation zum Erhalten eines Satzes 
von Differenzwerten berechnet; 20 

die Datenverarbeitungsvorrichtung jeden ein- 
zelnen der Differenzwerte durch einen entspre- 
chenden, erlaubten Winkelabweichungswert 
zum Erhalten eines Satzes von resultierenden 25 
Werten dividiert; 



tung (16) dem zu prufenden Strich eine Identi- 
tatsbezeichnung zuordnet, falls der mindeste 
metrische Entfernungswert kleiner ist als der 
obere Entscheidungsschwellenwert (Du), und 
falls der nachstmindeste metrische Abstands- 
wert gro/Jer ist als die Summe des mindesten 
metrischen Abstandswertes und des Differenz- 
entscheidungsschwellenwert.es (D D ). 

11. Verfahren nach Anspruch 1 oder 2 Oder 3, 
dadurch gekennzeichnet, dafi die Elemente 
Striche darstellen mit definierbarer Form und 
da/3 zumindest eine der Schablonen eine Kon- 
fusionsgruppenschablone bestehend aus zu- 
mindest zwei Schablonenstrichen darstellt, wo- 
bei jeder der Schablonenstriche einen Satz von 
Positionswerten zur Spezifizierung der Positio- 
nen entlang einem Strich und einen Satz von 
Abweichungswerten zur Spezifizierung der er- 
laubten Abweichung von den Positionswerten 
aufweist, wobei die Datenverarbeitungsvorrich- 
tung (16) eine Ubereinstimmung mit einer Kon- 
fusionsgruppenschablone anzeigt, wenn die er- 
ste Signalinformation mit einem in einer Konfu- 
sionsgruppenschablone enthaltenen speziellen 
Schablonenstrich ausgerichtet ist. 



die Datenverarbeitungsvorrichtung (16) jeden 
einzelnen der resultierenden Werte zum Erhal- 
ten eines Satzes von quadrierten Werten qua- 30 
driert; und 

die Datenverarbeitungsvorrichtung (16) samtli- 
che der quadrierten Werte zum Erhalten des 
metrischen Abstandswertes addiert. 35 



12. Verfahren nach Anspruch 1 oder 2 oder 3, 
dadurch gekennzeichnet, da/3 die Datenver- 
arbeitungsvorrichtung (16) eine Obereinstim- 
mung mit einer Konfusionsgruppenschablone 
dann anzeigt, wenn die erste Signalinformation 
innerhalb eines gemeinsamen Ortes enthalten 
ist, der samtliche der Musterinformationen der 
Konfusionsgruppenschablonen umschreibt. 



8. Verfahren nach Anspruch 4, dadurch gekenn- 
zeichnet, da/3 die Datenverarbeitungsvorrich- 
tung (16) die metrischen Entfernungswerte des 

zu prufenden Striches fur jede Schablone ver- 40 
gleicht mit einem oberen Entscheidungs- 
schwellenwert (Dj), mit einem unteren Ent- 
scheidungsschwellenwert (D L ) und mit einem 
Differenzentscheidungsschwellenwert (D D ) zum 
Erhalten der Identitatsbezeichnung. 45 

9. Verfahren nach Anspruch 8, dadurch gekenn- 
zeichnet, da/3 die Datenverarbeitungsvorrich- 
tung (16) eine Identitatsbezeichnung dem zu 
prufenden Strich zuordnet, falls der mindeste 50 
metrische Entfernungwert kleiner ist als der 
untere Entscheidungsschwellenwert (D L ), und 

falls der nachstmindeste metrische Abstands- 
wert gro/ier ist als der obere Entscheidungs- 
schwellenwert (Du). 55 

10. Verfahren nach Anspruch 8, dadurch gekenn- 
zeichnet, da/3 die Datenverarbeitungsvorrich- 



13. Vorrichtung zur Identifizierung von Mustern 
mittels einer Musterregistrierungs-(im Sinne ei- 
ner Dateneinfang-)vorrichtung (12) und einer 
Datenverarbeitungsvorrichtung (16), wobei je- 
des Muster zumindest ein Element basierend 
auf der relativen Bewegung zwischen einer 
musterbildenden Vorrichtung und einer muster- 
anpassenden Vorrichtung der Registrierungs- 
vorrichtung (12) aufweist, und die Registrie- 
rungsvorrichtung (12) zur Erzeugung einer er- 
sten Signalinformation, dargestellt durch erste 
Signale, welche Information uber das Element 
darstellen, betreibbar ist, die Datenverarbei- 
tungsvorrichtung (16) mit einer vorgespeicher- 
ten zweiten Signalinformation ausgestattet ist, 
welche durch zweite Signale dargestellt ist, 
und die Datenverarbeitungsvorrichtung (16) 
eine Vorrichtung (28) aufweist zum Speichern 
der zweiten Signalinformation als Schablonen 
fGr den Vergleich mit der ersten Signalinforma- 
tion, wobei jede der Schablonen zumindest ein 
Bezugselement definiert, und die Datenverar- 
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beitungsvorrichtung (16) zur Erzeugung von 
dritten Signalen darstellend eine dritte Signal- 
information, betreibbar ist, wobei die dritte Si- 
gnalinformation die Spezifizierung von Identi- 
tatsbezeichnungen zur Verwendung bei der 
Entwicklung kodierter Information zur Spezifi- 
zierung des Musters aufweist, und die Daten- 
verarbeitungsvorrichtung (16) eine Vorrichtung 
(54, 62, 65, 66) zum Vergleich der ersten Si- 
gnalinformation flir die beste Obereinstimmung 
mit einem Untersatz der ausgewahlten zweiten 
Signalinformation zum Erhalten einer Identi- 
tatsbezeichnung entsprechend der ersten Si- 
gnalinformation betreibbar ist, 

dadurch gekennzeichnet, da/3 

zumindest eine der Schablonen als eine Konfu- 
sionsgruppenschablone ausgewahit ist, welche 
Information liber zumindest zwei derartige Be- 
zugselemente enthalt, die vorlaufig als aquiva- 
lent behandelt sind. 

14. Vorrichtung nach Anspruch 13, dadurch ge- 
kennzeichnet, da/3 die Vergleichsvorrichtung 
(54, 62, 65, 66) der Datenverarbeitungsvorrich- 
tung (16) fur den Vergleich der ersten Signalin- 
formation und der zweiten Signalinformation 
betreibbar ist, urn eine spezifische Elementbe- 
zeichnung fur das zu prOfende Element als 
eine identitatsbezeichnung zu erhalten. 

15. Vorrichtung nach Anspruch 13, dadurch ge- 
kennzeichnet, da/3 die Vergleichsvorrichtung 
(54, 62, 65, 66) der Datenverarbeitungsvorrich- 
tung (16) fur den Vergleich der ersten Signalin- 
formation und der zweiten Signalinformation 
betreibbar ist, urn eine Identitatsbezeichnung 
spezifizierend eine Konfusionsgruppe zu erhal- 
ten. 

16. Vorrichtung nach Anspruch 13, dadurch ge- 
kennzeichnet, da/3 die erste Signalinformation 
und die zweite Signalinformation die relative 
Position entlang eines Striches beschreibend 
die Strichform spezifizieren, wobei die Daten- 
verarbeitungsvorrichtung (1 6) aufweist: 

eine Vorrichtung zum Berechnen der Entfer- 
nung zwischen der entsprechenden ersten Si- 
gnalinformation und der zweiten Signalinforma- 
tion zum Erhalten eines Satzes von Differenz- 
werten; 

eine Vorrichtung zum Dividieren jeden der Dif- 
ferenzwerte durch einen erlaubten Winkelab- 
weichungswert entsprechend jeder Lokalisie- 
rung der ersten Signalinformation entlang eines 



Striches zum Erhalten eines Satzes von resul- 
tierenden Werten; und 

eine Vorrichtung zum Addieren samtlicher re- 
5 sultierender Werte zum Erhalten eines metri- 

schen Entfernungswertes. 

17. Vorrichtung nach Anspruch 13, dadurch ge- 
kennzeichnet, da/3 die erste Signalinformation 
w und die zweite Signalinformation die relative 

Position entlang eines Striches beschreibend 
die Strichform spezifizieren, wobei die Verar- 
beitungsvorrichtung (16) aufweist: 

is eine Vorrichtung zum Berechnen der Entfer- 

nung zwischen der entsprechenden ersten Si- 
gnalinformation und der zweiten Signalinforma- 
tion zum Erhalten eines Satzes von Differenz- 
werten; 

20 

eine Vorrichtung zum Dividieren jeden der Dif- 
ferenzwerte durch einen erlaubten Winkelab- 
weichungswert entsprechend jeder Lokalisie- 
rung der ersten Signalinformation entlang eines 
25 Striches zum Erhalten eines Satzes von resul- 

tierenden Werten; 

eine Vorrichtung zum Quadrieren jeden der 
resultierenden Werte zum Erhalten eines Sat- 
30 zes von quadrierten Werten; und 

eine Vorrichtung zum Addieren von samtlichen 
des Satzes von quadrierten Werten zum Erhal- 
ten des metrischen Entfernungswertes. 

35 

1a Vorrichtung nach Anspruch 15, dadurch ge- 
kennzeichnet, da/3 die Datenverarbeitungsvor- 
richtung (16) eine Vorrichtung (70, 72) zum 
Vergleichen der metrischen Entfernungswerte 

40 des zu prufenden Striches fur jede Schablone 

mit einem oberen Entscheidungsschwellenwert 
(Du), mit einem unteren Entscheidungsschwel- 
lenwert (D L ), und mit einem Differenzentschei- 
dungsschwellenwert (D D ) zum Erhalten der 

45 Identitatsbezeichnung aufweist. 

19. Vorrichtung nach Anspruch 18, dadurch ge- 
kennzeichnet, da6 die Datenverarbeitungsvor- 
richtung (16) eine Vorrichtung (84) zum Zuord- 

50 nen einer Identitatsbezeichnung dem zu pru- 

fenden Strich aufweist, falls der mindeste me- 
trische Entfernungswert geringer ist als der 
untere Entscheidungsschwellenwert (D L ), und 
falls der nachstmindeste metrische Entfer- 

55 nungswert gro/3er ist als der obere Entschei- 

dungsschwellenwert (Du). 

20. Vorrichtung nach Anspruch 18, dadurch ge- 
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kennzeichnet, da/3 die Datenverarbeitungsvor- 
richtung (16) dem zu prufenden Strich eine 
Identitatsbezeichnung zuordnet, falls der min- 
deste metrische Entfernungswert kleiner ist als 
der obere Entscheidungsschwellenwert (Du), 5 
und falls der nachstmindeste metrische Entfer- 
nungswert groBer ist als die Summe des min- 
desten metrischen Entfernungswertes und des 
Differenzentscheidungsschwellenwertes (D D ). 
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